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ABSTRACT 

We present halo model predictions for the expected angular clustering and associated errors from the completed 
Sloan Digital Sky Survey (SDSS) photometric galaxy sample. These results are used to constrain halo model 
parameters under the assumption of a fixed ACDM cosmology using standard Fisher matrix techniques. Given 
the ability of the five-color SDSS photometry to separate galaxies into sub-populations by intrinsic color, we also 
use extensions of the standard halo model formalism to calculate the expected clustering of red and blue galaxy 
sub-populations as a further test of the galaxy evolution included in the semi-analytic methods for populating 
dark matter halos with galaxies. The extremely small sample variance and Poisson errors from the completed 
SDSS survey should result in very impressive constraints (~ 1 — 10%) on the halo model parameters for a simple 
magnitude-limited sample and should provide an extremely useful check on the behavior of current and future N- 
body simulations and semi-analytic techniques. We also show that similar constraints are possible using a narrow 
selection function, as would be possible using photometric redshifts, without making linear assumptions regarding 
the evolution of the underlying power spectra. In both cases, we explore the effects of uncertainty in the selection 
function on the resulting constraints and the degeneracies between various combinations of parameters. 

Subject headings: large scale structure; cosmology; galaxies:halos; galaxies:evolution 



1. INTRODUCTION 

In the standard picture of structure formation, initial pertur- 
bations in the dark matter density collapse into halos (White 
& Rees, 1978; White & Frenk, 1991) in a hierarchical man- 
ner, starting at small mass scales and moving to larger masses 
over time. Simulations of this process have found that these ha- 
los have a self-similar shapes (Navarro, Frenk & White, 1996, 
NFW, hereafter; Moore et al., 1998), provided that one allows 
for some difference in central density for halos as a function of 
mass (Navarro, Frenk & White, 1997). Following this distribu- 
tion of dark matter, we expect the baryonic matter to fall into 
these halos, cool and eventually form galaxies. This process 
can be simulated as well and the results can likewise be mod- 
eled by relatively simple semi-analytic methods (Kauffmann et 
al., 1999; Somerville & Primack, 1999; Benson et al., 2000). In 
addition to determining where galaxies form within a dark mat- 
ter halo, these methods can also give estimates of the morphol- 
ogy, color, and star formation rates for galaxies, under certain 
assumptions. 

The development of these prescriptions for the distribution 
of dark matter and galaxies has allowed for the calculation both 
the real-space two-point functions (Sheth & Jain, 1997; Jing 
et al., 1998; Peacock & Smith 2000) and power spectra (Sel- 
jak, 2000; Scoccimarro et al., 2000; Ma & Fry, 2000), gen- 
erating predictions for galaxy clustering statistics in both the 
linear and nonlinear regimes which are both physically well- 
motivated and quite simple. These treatments can be further 
generalized to accommodate different galaxy sub-population 
clustering (Scranton, 2002; S02, hereafter), allowing for more 
detailed calculations. 

On the data side, the next generation of galaxy surveys, in 
particular the Sloan Digital Sky Survey (SDSS; York et al. 2000; 
Gunn et al. 1998; Fukugita et al. 1996) have recently begun 
producing the anticipated large, rich galaxy catalogs. The ini- 
tial galaxy clustering measurements (Zehavi et al., 2001; Con- 
nolly et al., 2002; Scranton et al., 2002; Gaztanaga 2001) have 



demonstrated not only the remarkable quality of data possible 
with a fully digital large area survey but also the enormous 
statistical power which will be available from such a combi- 
nation of large area and depth of redshift. At the same time, 
the constraints on cosmological parameters (£2m, £2a, CTs, etc.) 
have been improved by recent measurements of the CMB (cf. 
Pryke et al. (2001)) and large scale structure measurements (cf. 
Tegmark et al. (2001)) to the extent that we can reasonably 
consider the details of galaxy clustering and evolution in the 
context of a fixed cosmology. 

The simultaneous development of these powerful tools for 
exploring galaxy clustering and evolution leads one to consider 
the possibilities for testing the predictions from the halo model 
against the future prospects of the data. Toward this end, we 
present calculations of the expected constraints on the halo model 
parameters from measurements of the angular clustering of galax- 
ies for the completed SDSS survey. Since we wish to test not 
only the ability of the halo model to predict general galaxy clus- 
tering but also the modeling of galaxy evolution, we extend the 
general calculation to include clustering of the expected red and 
blue sub-populations. The exploration of sub-population clus- 
tering has been done to a certain extent in the SDSS redshift 
survey (Zehavi et al., 2001). However, many of the halo model 
parameters we will be investigating affect the respective power 
spectra on small scales (k >~ 1/zMpc -1 ), a region where spec- 
troscopic surveys are plagued by redshift-space distortions and 
observational complications (e.g. the collisions between spec- 
troscopic fibers in the SDSS survey). Although these effects 
can be mitigated to a certain extent by projecting the clustering 
along the line of sight, one can achieve similar effects by con- 
sidering the angular clustering in the photometric catalog. This 
approach also offers the benefit of a much deeper look (z ~ 0.3) 
at the clustering for all galaxies than the SDSS main galaxy 
spectroscopic sample (z ~ 0.1) is capable of delivering. 

From the standpoint of the halo model, the importance of 
constraining the halo model parameters with observations is 
two-fold. The various components of the halo model (mass 
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function, concentration, etc.) have all been parameterized in 
various ways and fit to the results of simulations which try to 
replicate the underlying physics as close as is feasible. Our 
approach mirrors those efforts, replacing the simulation input 
with expected measurements on the sky. From the standpoint 
of the halo model as a theoretical construct, this sort of con- 
straint will provide an invaluable check on whether the param- 
eterizations coming out of simulations reasonably correspond 
to the data on the sky. If so, then we have a powerful tool for 
analytic calculations and future insight into the development of 
large scale structure and galaxy evolution. If not, we can isolate 
those parts of the formalism have failed (determining that the 
disagreement between simulation outputs and the observations 
is due to a failure in the assignment of galaxies to halos rather 
than the halo mass function, for instance). This feeds into the 
second aspect of the constraints: improving future simulations. 
The simulations carry greater information about the underlying 
physics than is present in the parameterizations that feed into 
the halo model calculations. Replacing simulation constraints 
on the halo model parameters with ones taken from the data 
should indicate which aspects of the physics that are included 
in the simulations are necessary to generate the observed clus- 
tering and evolution and what might be ignored. 

In §2, we review the basics of the halo model and the aug- 
mentations necessary to calculate sub-population power spec- 
tra, establishing the fiducial set of model parameters we will 
use for all our calculations. With this laid out, §3 discusses the 
two types of selection functions we will use to project the three- 
dimensional galaxy clustering onto the sky. §4 briefly discusses 
the formalism for calculating the angular correlations as well 
as the corresponding covariance matrices and the additional in- 
formation needed to account for uncertainties in the selection 
function. These angular correlations are converted into angular 
biases in §5, with their own attendant covariance matrix. These 
are then fed into the Fisher matrix formalism described in §6 
to produce constraints on each of the parameters. Finally, §7 
discusses the results of these calculations, the expected errors 
and degeneracies on each of the halo model parameters. 

2. THE FIDUCIAL MODEL 

For the purpose of calculating the power spectra that will 
feed into our angular correlations, there are two broad classes 
of parameters we will consider (we leave the details of how the 
parameters combine to produce power spectra to Appendix A). 
First, we have the general halo model parameters which de- 
scribe the overall dark matter halo profile and biasing. These 
parameters have been measured from a number of N-body sim- 
ulations and we will adopt conventional values. 

The second set of parameters describes the halo occupation 
density: the abundance and distribution of galaxies (and galaxy 
sub-populations) in the halos. As mentioned previously, a va- 
riety of semi-analytic techniques have been applied to N-body 
simulations. For the purpose of our calculations, we will use 
the outputs of the GIF simulations (Kauffmann et al., 1999) 
which have been generated using the SDSS magnitudes. These 
particular implementations of the GIF methodology have not 
been the subject of extensive inquiry in the literature, giving 
us more flexibility to explore the parameterization possibilities. 
Further, given the relatively light computational load of cal- 
culating eventual constraints on these parameters, we will al- 
low ourselves a generous parameter space, larger, in fact, than 
could easily be constrained by an actual measurement of the 



angular clustering for red and blue galaxies given simple com- 
putational methods. However, by performing this seemingly 
over-zealous calculation, we can determine which of the pa- 
rameters are well-constrained by the angular correlations and 
which might be fixed without significantly affecting the errors 
on the other more sensitive parameters. 

2.1. Dark Matter Halo Parameters 

Beginning with the general halo parameters, the fundamental 
unit of the halo model is the halo profile. This can be parame- 
terized along the lines of the profile derived by NFW, 
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where r s is the universal scale radius and p s = 2 3+a p(r s ). We 
can replace rs by a concentration, c = r v /rs, where r v is the 
virial radius. This radius is defined as the radius encompassing 
a region within which the fractional overdensity of the halo Ay 
(Eke, et al., 1996) scales as 

A v (z) = 187t 2 (i2 M (zr a5S ), (2) 

where Qm(z) is the matter density relative to the critical density 
for a given redshift, 
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Q.M is the matter density today relative to the critical density, 
and we have chosen a ACDM cosmology where Q.m + Qa = 
1. The concentration is a weak function of halo mass (c = 
c (M/M*f, where c ~ 0(10) and P - -o(10 _1 )). The tradi- 
tional NFW profile gives a = — 1 , while the Moore profile has 
oc= —3/2. We will use a = — 1.3 for the calculations in this pa- 
per, but the general results are largely insensitive to the choice 
of a. Bullock et al. (2001) gives cq = 9 for a pure NFW profile; 
using Peacock & Smith's relation, cq w 4.5 for a Moore pro- 
file. Since we are using an intermediate value of a, we choose 
cq =6 and p = —0.15 for all the calculations in this paper. In 
order to generate power spectra at a variety of redshifts, we also 
scale the concentration as cq <~ (1 + z) _1 for a given redshift z, 
as in Bullock etal. (2001). 

Once we know how the mass in a halo is distributed, we need 
to know how many halos of a given mass we expect to find, 
i.e. the halo mass function. Traditionally, this mass function is 
expressed in terms of a function /(v), 



(4) 



where v relates the minimum spherical over-density that has 
collapsed at a given redshift (8 C ) and the rms spherical fluctua- 
tions containing mass M (o(M)) as 

This can be generalized for an arbitrary redshift by taking the 
forms from Navarro, Frenk and White (1997), 
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and scaling o(M) as o(M,z) = a(M,0)D(z) where D(z) is the 
linear growth factor for a given redshift z normalized to unity 
at z = 0. We define as the mass corresponding to v = 1 . 
The functional form for /(v) is traditionally given by the Press- 
Schechter function (1974). This form tends to over-predict the 
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number of halos below M*, so we use the form found from 
simulations by Sheth and Tormen (1997), 

V/(v)~(l+V , ~V 1/2 *~ v72 ) (?) 
where v' = av, a = 0.707 and p = 0.3. This gives us a total 
of five general halo model parameters we might constrain with 
angular measurements: a, co, p, a and p. 

2.2. Galaxy Parameters 

As described in Appendix A, to calculate the galaxy power 
spectrum, we need to know the mean number of galaxies for 
a given halo mass ((N)(M)) and the second moment of the 
galaxy distribution ( (N(N — 1 )) (M)). Following the example in 
S02, we will generate different power spectra for red and blue 
galaxies by using different (N) relations for each of the sub- 
populations as well as changing the distribution of the galaxies 
within each halo. Before settling on the specific values of the 
respective parameters, we need to establish some necessary for- 
malism. 

Figure 1 shows the measurements of (N) (M) taken from the 
GIF simulations for red and blue galaxies. As in S02, we can 
parameterize the (N) (M) relations as modified power laws. In 
the case of red galaxies, the power law is cut-off by a lower 
mass limit (Mro), 
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For the blue galaxies, the modification is a bit more involved, 
including a Gaussian term to account for low-mass halos with 
a single blue galaxy. Thus, our (N)b(M) relation is 
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FIG. 1 . — (N) (M) for red and blue galaxies as measured at z = 0.27 in the 
GIF simulations. The solid lines indicate the values of (N)r and (A')b using 
the forms in Equations 8 and 9 and parameter values in Table 1 . The errorbars 
in both cases represent the Poisson errors in each mass bin. 



Table 1 
Fiducial model parameters 



Class 



Parameter 



Dark Matter Halo 



Galaxy HOD 




Value 



-1.3 

6 
0.15 
0.707 
0.3 



4.9 x 



50.6 

3 

10 13 h- l M, 
1.1 

l0 u h- l M e 
2.34 x l0 l3 h- l M e 
0.93 
0.65 
6.6 
11.73 
12.1 
0.27 







NOTES. — Galaxy Halo Occupation Density (HOD) parameters 
taken from GIF simulations at z = 0.27. Red galaxies taken to 
have g' — i' > 0.85 in rest frame colors. 



As mentioned above, we must also adopt a form for the second 
moment of the distributions, (N(N - l))(Af). In S02, we fol- 
lowed the results found by Scoccimarro et al. (2001) which re- 
lated (N(N — 1)) to (N) as (N(N-1))(M) = a 2 M (M)(N) 2 (M), 
where 



a M (M) = 
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(10) 

This gives the galaxies the Poisson distribution at large mass 
and sub-Poisson distribution observed in simulations. Since the 
mass dependence of ocm can significantly affect the small scale 
power, we would like to include this effect in our parameter 
constraints. However, to make the form easier to incorporate 
into our later Fisher matrix formalism, we use a sigmoid func- 
tion, 

(log(M)-M„)/a„" 
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where M a and o a are chosen to match the behavior in the GIF 
simulations (fitting to match the behavior in Equation 10 results 
in values of 12 and 0.45 forM a and o a , respectively). 

While Equations 8 and 9 are sufficient to describe the ex- 
pected number of red and blue galaxies in a halo of given mass, 
they do not suffice to match the observation that red and blue 
galaxies have different radial distributions within a given halo, 
with red galaxies tending to populate the halo center and blue 
galaxies the outer regions. To generate different distributions 
for our red and blue galaxies within each halo, we follow the 
method outlined in S02, using profiles of the form given in 
Equation 1 as the distribution functions for the red and blue 
galaxies within each halo. In principle, one could construct 
distribution functions for each of the galaxy sub-populations 
from observations of their relative abundances at a number of 
radii. However, by assuming a form for the distributions, the 
power-law indices and relative normalizations of these distri- 
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butions are effectively determined by considering two param- 
eters: the ratio of blue to red galaxies at large halo radii (r|) 
and the inverse ratio at small radii (/j). Since these ratios re- 
late to the number of galaxies, rather than the mass assigned to 
the galaxies, we have to transform these quantities through our 
(N) relations from Equations 8 and 9 in order to maintain mass 
conservation within the halo. Thus, we define r|' and // as 

i/Tb M b 
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For large radii, the profiles will both scale as p ~ r 3 , so T]' sets 
the relative normalization directly: 
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The choice of radius (r,) for the measurement of /j is somewhat 
arbitrary, so we follow S02 in setting r,-c/r v = 0.1, (where c is 
the halo concentration and r v is the virial radius) for our calcu- 
lations. This gives us the difference in the power law indices 
for the red and blue galaxies (Aoc), 

logO/Tl') 



Aa = ocb - a R = 



(14) 



log(l+r/c/r v )-log(r,-c/r v ) 
With this relation between aR and de in hand, we can perform 
a simple search over values of aR to find the sub-profiles that 
combine to closest match an overall profile with a given value 
of a. Since we know that Aa must be positive, this relation 
guarantees a flatter distribution of blue galaxies in the center 
of halos relative to red galaxies and Equation 13 produces rela- 
tively more blue galaxies in the outer regions. 

With this formalism in place, the only elements missing are 
actual values for the parameters in Equations 8, 9, 11, and 12. 
Since we need the (N) relations to calibrate /j, T], M a and a a , 
we begin with (N)r and (N)q. 

Typically, the (N) relations are determined from simulations 
using very wide or open ended magnitude cuts. However, as 
shown in Figure 2, we can see that the shape of the (N) rela- 
tions can change quite dramatically if we consider only a nar- 
row range in apparent magnitude (like those described later in 
§3), particularly the location of the Gaussian component. As a 
consequence of this complication, any constraints made on the 
parameters in (N)r(M) and (N)b(M) from a given magnitude 
cut will not accurately describe the complete number-mass re- 
lationships. This situation can be salvaged to some extent how- 
ever, based upon the fact that shape of the number-mass surface 
does not appear to vary strongly with redshift (the entire surface 
does shift to fainter magnitudes with increasing redshift, as one 
would expect for apparent magnitudes). Thus, with a volume 
limited sample, one could in principle constrain the entire sur- 
face with a series of magnitude limited measurements. 

Leaving aside these concerns, we can fit the parameters in 
Equations 8 and 9 for the magnitude cut in §4 (20 < r 1 < 21) 
for a typical peak redshift in the selection functions at z ~ 0.3. 
The results of these fits are shown in Table 1; a comparison 
of the fits to the outputs of the simulations and a discussion of 
the color cut used are given in Appendix B. The parameters 
listed show a significant difference from previous similar fits 
(e.g Scoccimarro et al. (2001)). This is not surprising, how- 
ever, since our fits are made at higher redshift, resulting in a 






FIG. 2. — (N) as a function of halo virial mass and r' magnitude from the 
GIF simulations at z = 0.06. From top to bottom, the panels give the surfaces 
for all galaxies, blue galaxies (rest frame g' — ;' < 0.85) and red galaxies (rest 
frame g 1 — i' > 0.85), respectively. Contours correspond to a change in 0.25 
for log ({A')). Filled contours indicate log({/V)) > and wire-frame contours 
indicate log({/V)) < 0. 
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FIG. 3. — Comparison of fraction of red and blue galaxies as a function 
of rc/r v . The lines give the distributions predicted from the sub-population 
distributions discussed in §2.2 and the points show the distribution of galaxies 
from the GIF simulations. For the simulated data, red galaxies were taken to 
have g' — i! > 0.85 in the rest frame (see Appendix B for more details). Error 
bars represent the Poisson error for all of the galaxies in a given radial bin, 
scaled appropriately for the fractional comparison. 



correspondingly higher mass scale for all of the (N) relations. 
Likewise, the narrow magnitude cut leads to a relative suppres- 
sion of the (N) relations at smaller halo mass giving larger val- 
ues of ye than have been reported previously. It is interesting, 
however, that the combination of magnitude and redshift did 
not significantly change the (N(N — 1)) mass scale, allowing us 
to use similar values in Equation 1 1 as those matching the form 
of Equation 10. At this redshift, we also observe a higher value 
of /j in the inner regions of the halo, although the value of r| 
is largely unchanged from the z — value in S02. To test this 
set of parameters, we can compare the distribution of red and 
blue galaxies in the simulation to that which we would predict 
from our model. As we can see in Figure 3, this combination of 
parameters reconstruct the halo galaxy distribution observed in 
the simulations reasonably well. 

Putting the parameters from Table 1 into our prescription for 
calculating the power spectra leads to the curves shown in Fig- 
ure 4 for z = 0.3. As with the z = power spectra in S02, there 
is significant biasing in the red galaxy sample and anti-biasing 
in the blue galaxies. Likewise, there is a similar (but weaker) 
large k break in the blue power spectrum. The power spec- 
trum for the whole galaxy sample shows a slight break around 
k ~ 10/iMpc -1 which is not seen in the z = case or in low 
redshift observations (cf. Hamilton et al., 2000). At this small 
scale, the power spectra are dominated by the smallest mass 
halos which, in turn, are almost exclusively populated by blue 
galaxies, particularly at higher redshifts. This suggests that we 
should not be surprised to find a break in the power law behav- 
ior of wxx(8) for a photometric redshift-selected galaxy sam- 
ple. 

3. REDSHIFT DISTRIBUTION 



Table 2 

Redshift distribution parameters for a simple 
magnitude cut and narrow window distribution. 



Distribution 


Galaxy Type 


a 


zo 


b 


Magnitude Cut 


Early 


2.6 


0.37 


3.3 




Intermediate 


2.0 


0.37 


2.45 




Late 


0.98 


0.4 


2.28 




All 


1.28 


0.42 


2.94 


Narrow Window 


Early 


12 


0.35 


15 




Intermediate 


12 


0.3 


15 




Late 


12 


0.3 


15 




All 


12 


0.32 


15 



To project the three dimensional power spectrum onto the sky 
we need to know the redshift distributions of the red and blue 
galaxies in our sample. In order to separate the galaxies into red 
and blue classes, we would need to have photometric redshift 
information available on each galaxy, meaning that we could, 
in principle, choose the galaxy sample to match any selection 
function we desired within the bounds of the survey limits. Al- 
ternatively, we could use the redshift distribution resulting from 
a simple magnitude cut. The magnitude cut results in a some- 
what broad distribution of redshifts, so we will use the arbitrary 
redshift distribution to explore the effects of a much narrower 
redshift window. In either case, however, we are limited by 
the ability of the photometric redshift calculations to cleanly 
separate our galaxy sample into red and blue types, which, in 
turn, is limited by the photometric errors at a given magnitude. 
Based on early results from the photometric redshift work on 
the SDSS, the practical limit for reliable redshifts is r* w 21. 

For the case of a simple magnitude cut, we can follow the 
method used by Dodelson et al (2001) in modifying the redshift 
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FIG. 4. — Power spectra at z = 0.3 for red (PRR(k)) and blue (Pbb (k)) galax- 
ies compared to the linear dark matter (Pun{k)) an d galaxy (Paa(k)) power 
spectra. 
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FIG. 5. — The top panel gives the normalized redshift distributions for all 
galaxies with 20 < r' < 2 1 and the red and blue sub-populations, as modified 
for SDSS colors from the CNOC2 survey. The bottom panel gives an example 
of narrow redshift distributions possible for a sample of galaxies using photo- 
metric redshifts. 



distributions found from the CNOC2 survey (Lin et al, 2000) to 
match the SDSS filters. The redshift distributions for red and 
blue galaxies (taken from Lin's early type and the combination 
of intermediate and late types, respectively) can be seen in the 
left panel of Figure 5. Since these distribution functions are 
taken from the morphological types rather than the color cut 
mentioned in Table 1 and Appendix B, we do not expect these 
distributions to exactly match those found in the final SDSS 
data, but they should give us a reasonable approximation. The 
shape of these distributions can be well approximated by a func- 
tion of the form 

^|Wexp(-(z/zo) 6 ). (15) 

The fits to the parameters in this function for red and blue galax- 
ies with 20 < r 1 < 21 are given in Table 2. We also can use 
this form to specify the shape of the narrow redshift distribu- 
tion, adopting the second set of parameters in Table 2, result- 
ing in the redshift distributions shown in the right panel of Fig- 
ure 5. By design, these artificial distributions have roughly the 
same peak in redshift as the magnitude cut distributions, which 
should make the eventual comparison independent of evolution 
effects in the power spectrum. In addition, they all avoid the re- 
gion around z ~ 0.4 as much as possible. This is particularly 
important due to the limitations of the photometric redshifts 
available in with the SDSS filters; there is a degeneracy in the 
colors for early type galaxies near a redshift of 0.4 resulting in 
large uncertainty in the photometric redshift for these objects. 

4. CALCULATING w(0) AND COVARIANCE MATRICES 

4.1. Limber's Equation 

With the power spectra and selection functions in hand, we 
can calculate the expected angular correlations for the red and 
blue galaxies (wrr(9) and wbb(6), respectively) using Lim- 
ber's equation: 

wxx(e) = ^j dkk j d%F x (x)P xx (k,x)MkBx) (16) 



where % is the comoving angular diameter distance and Jq is 
the Bessel function. We could also consider the angular cross- 
correlation between red and blue galaxies, but since we will be 
re-casting the angular correlations in terms of relative biases in 
§5, the cross-correlation will not yield any additional informa- 
tion. The normalized redshift distributions, Fr(%) and Fb(%), 
are given by 

1 dNx dz ' 



MX) = 



Nx dz d% 



where 



^ = Ho + z) 3 + a K (1 + z f + Q A ] 



1/2 



(17) 



(18) 



for a given Hubble constant (//o), matter density (£2m), cosmo- 
logical constant (£2a) and curvature {Q.k = 1 — £Im — ^a)- As 
mentioned previously, we will use a flat ACDM model (£1« = 
0.3, Q.a = 0.7, h — 0.7) and normalize the linear power spec- 
trum such that Og = 0.9 and n$ = 1. 

We can simplify Equation 16 by assuming a linear scaling re- 
lation for the power spectrum, Pxx{k,%) — D 2 (%)Pxx(k), where 
we choose Pxx(k) to have the shape of the power spectrum at 
the peak redshift of Fx(%)- Making this substitution gives us 

wxx(B)=4n 2 jdkkP X x(k) j d%Fx(x)D 2 {x)Jo{kBx). (19) 

The results of performing this calculation for the three power 
spectra and the two sets of selection function are shown in 
Figure 6. In addition to the assumption of linear power spec- 
trum scaling, we also truncate our integral over wavenumber at 
k = lOO/iMpc -1 ; this does not change the calculated values of 
wxx(9) d ue to tne f at sna P e of the kernel at small kQ and allows 
us to avoid any complication of the power spectrum shape due 
to baryon concentration in the innermost regions of the halo. 

The price we pay for the assumption of linear power spec- 
trum scaling is the neglect of any change in the shape of the 
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power spectrum at large A; as a function of redshift due to non- 
linear effects. For the photometric redshift selection functions, 
this is not a serious problem, but the width of the magnitude- 
based selection function gives one pause, particularly when con- 
sidering the very small expected errors (§4.2). However, given 
the nature of the surfaces in Figure 2, it is clear that any changes 
in the shape of the power spectrum we might see will be a 
reflection of the changing (N) relations as a function of red- 
shift. This makes our choice of fiducial model for (N)r and 
(N)b somewhat nebulous. We can see the effects of calculat- 
ing wxx(8) with and without the linear approximation for the 
two selection functions in Figure 7. As expected, the photo- 
metric redshift selection function shows little to no effect even 
when we calculate the power spectrum using the full (N) and 
(N(N — 1)) surfaces. For the magnitude selection selection 
function, the results are identical when we hold the (N) rela- 
tions the same, but show strong deviations if we use the (N) 
and (N(N — 1)) surfaces to calculate wxx(9) without the linear 
approximation. Fortunately, as we will see in §7, the equiva- 
lent constraints obtained with the photometric redshift selection 
function make this concern moot. 

4.2. Covariance Matrices 

Since we will need to account for the fact that the angular 
bins in our measurement are going to be highly correlated, we 
must deal with the full covariance matrix and not just the ex- 
pected errors on each bin. For the expected Gaussian and non- 
Gaussian sample variance covariance matrices, we follow the 



10 fe- 

<£> 10 
10"' k- 




Linear evolution, fixed (N) 
Nonlinear evolution, fixed (N) 
Linear evolution, <N> surface 
Nonlinear evolution, <N> surface 



0.001 



0.01 



0.1 

(degrees) 



10 



FIG. 7. — Angular correlations for all galaxies (»'gg(6)) under a number of 
assumptions for power spectrum evolution. The black and red curves show the 
angular correlations (and associated errors) which have been calculated using 
the (A')r, (A^b and relations given by Equations 8, 9 and 1 1 (respectively) 
and the parameter values in Table 1 . The black curve assumes the linear power 
spectrum evolution in Equation 19, while the red curve calculates m>gg(6) us- 
ing Equation 16. The blue and cyan curves take their (N) and (N(N — 1)) 
relations taken directly from surfaces like those shown in Figure 2 (shifted in 
magnitude appropriately for a given redshift) to calculate the power spectrum 
under linear and non-linear assumptions of Equations 19 and 16, respectively. 
The upper panel shows the angular correlations for the magnitude-based selec- 
tion function and the lower for the photometric redshift selection function. 



prescription given in Eisenstein & Zaldarriaga (2001) (as well 
as similar treatments in Cooray & Hu (2001) and Scoccimarro, 
et al. (1999)). The sample variance (Cj(9,9')) consists of two 
parts, a component dependent only on the two-point angular 
correlation function (the Gaussian covariance) and a second 
piece which is a function of the four-point angular correlation 
(the non-Gaussian covariance): 

c 5 (e,e') = (we) - w(e))(w(e') - *>(e'))) 

= c G (e,e')+c TO (e,e'). (20) 

In order to make the calculation simpler, we can re- write Equa- 
tion 19 in terms of the angular power spectrum (9xx{K)), 

/KdK 
—Vxx(K)J (Ke), (21) 

where P xx (K) is given in angular wavenumber space (K = k%) 
and 



¥ XX (K) = - I clkPxx(k)FxiX) 



X 2 



Pxx{k). (22) 



Using this formulation, we can write the Gaussian covariance 
matrix (C G (9,9')) as 
1 



c G (e,e') = 



KAc 



dKKPMKyoiKfyJoiKd'), (23) 



where the area of the survey (Aq) is K steradians in the case of 
the SDSS. 

To calculate the non-Gaussian component, we need to gen- 
erate an estimate of the trispectrum for our halo model. For- 
tunately, as Cooray and Hu (2001) indicate, the majority of the 
non-Gaussian covariance can be accounted for by merely calcu- 
lating the single halo contribution of the trispectrum. This term, 
which is independent of configuration under the assumption of 
spherical halos, is given by 

T^hMMM) = (24) 



(4) 

ii 4 Jo J{ ' M(v) 



(AO 



\y{h,M)\\y{k 2 ,M)\\y{k 3 ,M)\\y{kA,M)\cN, 



where (A^)' 4 ' = <X A M {N) A under the assumptions in Equation 11. 
As with the case of the Gaussian component, we need to project 
the trispectrum into angular wavenumber space, 



T 4X (K U K 2 ,K 3 ,K 4 ) = / d% 



T 4x {k\,k 2 MM)- (25) 



Finally, the angular trispectrum needs to be averaged over an 
annulus in angular wavenumber space, 



■Txx^i,^) 



d 2 K y f d 2 K 2 



T 4 x(K h K h K 2 ,K 2 ), (26) 



Ay J Ay 

where A r is the area of the annulus. In the limit of narrow bin- 
ning, we can approximate Txx(K\ ,K 2 ) by calculating 



Txxihto) 



n 4 Jo 



(A0 (4) 9 

f(v))^y 2 (kuM)y 2 (k 2 ,M)dv, (27) 



and appropriately transforming into angular wavenumber space. 
With this in hand, we can calculate the non-Gaussian compo- 
nent of the sample variance using 

Qv- G (8,e')= (28) 

J dKK j dK'K'<Txx(K,K')J (Ke)J (K'e'). 

As the final component of the statistical error, we can add a 
Poisson term (C/>(9,9')) to the diagonal elements, 

An s 



Cp(9,8') 



>q.q> 



(29) 
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FIG. 8. — Variations in the redshift distribution function errors. The top 
panel gives the variations for the magnitude-based selection function and the 
bottom for the photometric redshift selection function. 



where N is the total number of galaxies, 8i2 is the area of the 
angular bin and 89 9/ is the standard Kronecker delta. For the 
full photometric survey, the SDSS will contain on order 200 
million objects. We can scale this appropriately for the 20 < 
r' < 21 magnitude cut from §3, using 50 million galaxies for 
the full sample and half that for each of the sub-samples. For 
the photometric redshift selection function, we can combine it 
with the magnitude selection function, resulting in 8 million 
galaxies within the redshift and magnitude ranges. 

In addition to the statistical errors due to sample variance 
and shot noise, we need to consider the errors in our calcu- 
lation of the expected angular correlations due to uncertainty 
in the underlying redshift distribution. Indeed, in §7, we will 
see that, given the relatively large area observed and number of 
galaxies which will be in the final SDSS data set, the dominant 
source of error in our final constraints will come from the error 
in the redshift distributions. To model the errors in the selec- 
tion function, we follow the treatment given in Dodelson et al. 
(2001). Since we do not have the exact errors for the param- 
eter fits to the eventual redshift distributions, we will consider 
errors on those parameters of 1, 3, 5 and 10 percent. Drawing 
from 50,000 Monte Carlo realizations of the redshift distribu- 
tions with these variations, we can calculate the covariance of 
the expected angular correlation functions (Cz(8,8')) and use 
this to approximate the covariance from the redshift distribu- 
tion uncertainty. Figure 8 shows the average deviation from the 
magnitude cut redshift distribution for each of the parameter 
variation levels. Bringing these three pieces together gives us 
our final covariance matrix (C(9,9')) 

C = C g + C ng + Cp + C z . (30) 
Figure 9 shows the error in wgg(9) due to each term in the 
covariance matrix as a function of angle. In the absence of se- 
lection function errors, the errors on the very smallest angu- 
lar scales are dominated by shot noise, giving way to Gaussian 
sample variance at larger angles. Non-Gaussian sample vari- 
ance is small enough to ignore even at small angles. When 



the errors due to uncertainties in the selection function are in- 
cluded, they quickly dominate the statistical errors on small an- 
gular scales in all but the most modest error regime. 

5. RELATIVE BIASES 

In principle, the angular correlations we have calculated so 
far are sufficient to calculate the parameter constraints. How- 
ever, the range of values that wxx(9) takes on as a function of 
can be a problem for numerical derivatives. To avoid that, 
we can instead use the angular biases given by the ratios of the 
angular correlations, 



'4b (9) 



^Bo(e) = 



wrr(9) 
wbb(8) 
wrr(9) 
wgg(9) 
wbb (9) 



(31) 



wgg(9) ' 

where wgg(9) is the angular correlation for all of the galaxies. 
As Figure 10 shows, this switch not only decreases the absolute 
range of we must consider, but also gives us more features in 
the curves to help determine the parameters. The price that we 
pay for this improvement is an additional step in the calculation 
of the covariance matrix we will use later in §6. Likewise, the 
larger amplitude wxx(9)'s for the photometric redshift based 
selection function (and commensurate larger values in the co- 
variance matrix) results in proportionally larger errors on the 
resulting relative biases. 

In order to properly determine the errors on the biases, we 
need to take into consideration the correlations between the an- 
gular bins in w\x (9) as indicated by each correlation function's 
covariance matrix. To do this, we decompose the covariance 
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FIG. 9. — Statistical and selection function contributions to the error on 
wgg(6) for the magnitude-based selection function. For the photometric red- 
shift selection function, the results are similar for the Poisson, Gaussian and 
selection function errors. The contribution from non-Gaussian sample error 
is relatively stronger for the photometric selection function, but still remains 
small enough to safely ignore in the total covariance matrix. 
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matrix (C) into its eigenmodes, creating a basis (R) where each 
of the modes is independent and the variance on that mode is 
given by the associated eigenvalue (c,-). We can project wxx(0) 
into that basis as 

wxx(e)' = R r w X x(e), (32) 
where R T is the transpose of R. Since we require that the co- 
variance matrix be positive definite, we set any c, which is nega- 
tive due to numerical errors to zero and remove that mode from 
r'. Within this basis, we can create a set of Monte Carlo realiza- 
tions of each vvxx(0)' which, when transformed back into the 
angular basis, will have the correct covariance. Thus, the mean 
ratios of these realizations will give us the values for Equa- 
tion 32 with the proper correlation between angular bins and 
the covariance between these realizations gives us the correct 
errors on our relative biases. 

6. FISHER MATRIX CALCULATION 

To estimate the expected errors on the parameters in Table 1 , 
we can use the standard Fisher matrix formalism. Choosing 
a fiducial set of parameters leads to a reference angular bias, 
b 2 (Q). We can approximate the likelihood for some variation of 
the parameters as 



n — i — i — i — i — i — i — i — i — i — i — i — i — i — i — i — T 



1 



- exp 



-i8(e)c B 1 (e,e')5(e') 



(33) 



where N is the number of angular bins, Cb(0, 0') is the covari- 
ance matrix for the angular biases from the Monte Carlo cal- 
culations in §5, Cg is the determinant of Cb(0,0') and 8(0) = 
{b 2 (Q) — o 2 (8)). The Fisher matrix is related to the likelihood 
function as 

/ dlnL \ 

Fotp = - ( > , (34) 

and (F~ 1 ) a p gives us the covariance between parameters a and 
P marginalized over all other parameters, while (F a p) _1 gives 
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FIG. 10. — Relative angular biases b^ B (Q), b^ a (ff) and ^| G (9) for the 
magnitude-based selection function (upper panel) and the photometric redshift 
selection function (lower panel). 




FIG. 11. — Marginalized and unmarginalized errors (dx and S.t, respec- 
tively) for each parameter in each of the selection function error regimes for 
the magnitude-based selection function. 



us the covariance without marginalizing over all other param- 
eters. Plugging our likelihood function from Equation 33 into 
Equation 34 gives us the Fisher matrix in terms of first deriva- 
tives, 



H a(3 



db 2 (d) 

dx a 



^'(0,0' 



db 2 {&) 
3xp 



(35) 



In principle, only two of the relative biases given in Equa- 
tion 32 would be necessary to constrain the halo model param- 
eters. However, given the amplitude of the errors when the ef- 
fects of errors in the selection function are included (particu- 
larly in the 10% case), we find a more stable solution when us- 
ing all three relative biases. This makes our b 2 (Q) a concatena- 
tion of £> RB (0), ^rg(^) an d ^bg(^) an d requires us to calculate 
a joint covariance matrix for all biases. In order to eliminate 
the presence of singular modes in this joint matrix, we only 
consider those angular bins from £>| B (0), ^rg(9) an d ^bg(^) 
which have errors less than the amplitude of the bias in all three 
measurements, reducing our number of angular bins by about 
one third. The addition of the third relative bias does result in 
some degenerate modes in our covariance matrix, but a standard 
singular value decomposition routine can handle these modes 
adequately, resulting in numerically identical Fisher matrices 
for either 2 or 3 relative biases in the small selection function 
error cases. 

In addition to these concerns, there is also the need to use a 
sufficient number of Monte Carlo realizations to ensure that the 
statistical noise from the realization (which goes as 8 ~ 1 /\/N 
for N realizations) is sufficiently low to allow for an accurate 
numerical calculation of the derivatives in Equation 35. To meet 
this requirement, we used one million Monte Carlo realizations 
for each derivative calculation, achieving a statistical noise in 
the derivatives on order 0.1%. Given this level of precision, 
we calculated the derivatives using centered derivatives with 
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FIG. 12. — Same as Figure 11, but for the photometric redshift selection 
function. 



a typical step size (Ax/x = 8 1 / 3 ) of 10% for each parameter. 
Given the much larger absolute value of Mr, Mb, and Mro than 
the remainder of our parameters, we calculated the estimated 
constraint on the logarithm of each of these parameters, rather 
than the full value. Because of this transformation we reduced 
the step size for these parameters (as well as Mb s and M a ) to 
5%. The accuracy of the resulting Fisher matrix was verified by 
comparison with a Fisher matrix calculated from the estimates 
of the noise in each derivative, typically resulting in a noise on 
the diagonal of the Fisher matrix of less than half a percent. 

7. RESULTS 

As with any such calculation, there are essentially two ques- 
tions to be addressed: what is the expected magnitude of the er- 
rors on each parameter and what are the expected degeneracies 
between the various parameters. We will take these questions 
in turn. 

7.1. Error Magnitudes 

Figures 11 and 12 give the fractional errors (dx/x and 8x/x) 
for each of the selection functions and selection function error 
regimes, where we take dx to be the marginalized error on pa- 
rameter x, and 8x to be the unmarginalized error: 

(dx) 2 ee (F~% (36) 

(5x) 2 ^(F A ,r 1 . 
The immediately striking aspect of each of these plots is the 
minuscule expected error in log(Me), log(MR) and log(MRo), 
particularly in the right panel. This is to be expected, how- 
ever, since these are the fractional errors in logarithmic quanti- 
ties. Translated to errors on the actual mass scales, these cor- 
respond to roughly 1 % marginalized errors for both selection 
functions in the absence of selection function error. Addition- 
ally, we can see that, while the fractional errors are less impres- 



sive for the magnitude-based selection function, the errors us- 
ing this selection function are more robust against errors in the 
selection function. In general, we can see that there is not an 
enormous difference in expected fractional error between the 
two selection functions for most parameters. The magnitude- 
based selection function appears to be marginally more robust 
against increasing selection function error. This probably does 
not compensate for the short-comings of making the linear as- 
sumption shown in Figure 7, but is worth bearing in mind if 
there is significant error in the photometric redshift (e.g. large 
errors due to color degeneracies). 

For the general halo model parameters, the expected errors 
are quite small. Indeed, given the disparity between values of 
a for the NFW and Moore profiles, as well as the scatter on 
the concentration parameter listed in Bullock et al. (2001), it 
is possible that the expected errors on these parameters could 
be smaller than the associated errors from simulations. For the 
sub-population parameters, the expected errors on r| and /j are 
larger than the Poisson errors on these parameters from the GIF 
simulations. However, they should be sufficient to determine 
that they are different from unity, and hence the usefulness of 
the formalism developed in §2.2. The shape of the Gaussian 
component (as given by A and Aq) in (N)b is not as well de- 
termined by the angular correlations as by the simulations, but 
the errors on the mass scales and power law indices (Mr, Mb, 
Mro, Mb s , Yr and Yb) should be. Finally, the mass scale for the 
deviation from a Poisson distribution in the (N(N — 1)) relation 
(M a ) should be constrained as well from the angular correla- 
tions as in the simulations, but the rate of that deviation (o a ) is 
not. As in the case of A and Aq, this constraint may improve 
given a different set of (N) parameters. 

7.2. Parameter Degeneracies 
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FIG. 13. — Expected error ellipses for four combinations of parameters, nor- 
malized by the respective parameter values, for the magnitude-based selection 
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Similar results were obtained for the photometric redshift selection function, 
with the exception of the /i vs T| plot, which was nearly orthogonal. 
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There are two means by which we can examine the degenera- 
cies between the various halo model parameters. First, we can 
look at the error ellipses between various parameters we expect 
to be correlated (e.g. p & T|, cq & p, M a & a a ). While useful for 
considering particular pairs of parameters, this approach does 
not reveal the full extent of the correlations between all the pa- 
rameters. To examine this, we can decompose the Fisher matrix 
into its eigenvectors. Provided the Fisher matrix is not singular, 
these eigenvectors define a basis of orthogonal combinations of 
parameters and tell us which combinations of halo parameters 
are naturally constrained by the angular correlations. 

The error ellipse for each pair of parameters can be con- 
structed taking the four corresponding elements of F, invert- 
ing and decomposing the resulting matrix into its eigencompo- 
nents. This effectively fixes all of the other parameters in F, re- 
sulting in errors on each parameter on order those found by tak- 
ing (F xv ) -1 / 2 . Figure 13 gives the expected error ellipses (nor- 
malized by the respective parameter values) for the three com- 
binations of parameters listed above, as well as the expected er- 
ror ellipse for a & p, in the limit of a perfect magnitude-based 
selection function. In all four cases, as with the other 149 com- 
binations of parameters, the shape of the error ellipse is not 
discernibly different for the two selection functions, although 
the size will vary between selection functions according to the 
relative values of 8x for the two selection functions. As we can 
see, our model breaks most of the degeneracy between p and r| 
seen in the model used in S02, resulting in a nearly orthogonal 
error ellipse. In contrast, we can see rather strong correlations 
between the expected errors for our concentration relation. 

In addition to looking at correlations between parameters, we 
can also look at how those correlations vary as we increase the 
contribution to our error calculations from the selection func- 
tion. For the case of M a & o a , we can see from Figure 14 
that, along with the expected increase in the size of the error 
ellipse, there is also some degree of wavering in the degree of 
correlation in the various error regimes. In general, we find that 
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this behavior is fairly consistent for all of the various parame- 
ter combinations, with the exception of the 10% error regime. 
In this case, where the covariance on the angular bias is most 
strongly influenced by the selection function error, there are a 
number of parameter combinations (Mr and Mb, for instance) 
where the 10% error ellipses were significantly rotated from the 
other ellipses, as much as 90 degrees in some cases. Likewise, 
the behavior of the 10% ellipse for a given pair of parameters 
for the magnitude-based selection function appeared to be of 
little use in predicting the orientation of the same error regime 
in the photometric redshift selection function. 

Having figuratively tested the waters with the error ellipses, 
we can move on to the more daunting task of examining the 
eigenbases for the various Fisher matrices to determine the pa- 
rameter combinations that our measurements best constrain. Ta- 
bles 3 and 4 give the eigenbases for the two selection functions 
in the limit of no selection function error. In both cases, we have 
ignored those parameters in each eigenmode which contribute 
less than 2% to the total amplitude of the mode (i.e. all those 
parameters whose eigenmode coefficient e,- was |e, | < 0.14). As 
one might have guessed, there are relatively few modes that are 
simply determined by one or two halo parameters. The excep- 
tions to this rule occur for the modes with the best and worst 
constraints, where we find strong constraints on simple combi- 
nations of a few parameters (Yr, a and p) contrasted with rel- 
atively weaker constraints on single parameters (co, t|, Aq and 
p). Not surprisingly, members of this second set tend to also 
have the worst fractional unmarginalized errors and vice versa 
for the elements of the first set. 

Likewise, we can see some reflection of the error ellipses in 
the eigenbasis. The parameter pairs p & r| and a & p were very 
nearly uncorrected in Figure 13 and we can see that these com- 
binations of parameters do not appear in any of the eigenmodes 
for either of the Fisher matrices. Conversely, the M a & a a pan- 
almost invariably appear together in the eigenmodes and with 
the expected relative signs and amplitudes. This is not a perfect 
guide, however, given the correlation between co and p in the 
error ellipses and the absence of an eigenmode containing both 
parameters. 

When we add in errors due to selection function uncertainty, 
the effects are similar to those seen in the error ellipses. The 
eigenmodes remain mostly unchanged with increasing selec- 
tion function error; the contribution of each parameter to a given 
eigenmode remains identical within ~ 2% of the total eigen- 
mode amplitude and appears to preserve the relative signs. Like- 
wise, we see an increase in the error on each eigenmode as the 
selection function error increases; errors increase by a factor 
of 2 for each 5% of selection function error. The one case of 
significant change in the eigenbasis we do see is in the best 
constrained eigenmodes; the degeneracy between Yr, a and p 
changes for both selection functions, leading to an idependent 
mode for a in the magnitude-base selection function and inde- 
pendent modes for both Yr and a in the photometric case. 

8. CONCLUSION 

As hoped, the eventual angular clustering measurements for 
red and blue galaxies should provide strong constraints (~ 1 — 
10%) on a wide variety of halo model parameters. This remains 
the case for moderate levels of uncertainty in the selection func- 
tion. In the limit of small selection function errors, we can 
achieve similar constraints using both a magnitude and photo- 
metric redshift based selection functions. This second approach 



12 



Scranton 



also allows us to have much greater confidence that we are con- 
straining a simple (N) relation, rather than a weighted projec- 
tion over the (N) surfaces in Figure 2. In either case, we can see 
that the parameters relating to the shape of the Gaussian part of 
(N)b do not strongly affect the angular clustering, but the mass 
scales and power law indices in (N)r & (N)b should be very 
tightly constrained. In addition, we should have sufficient con- 
straints on the parameters relating the halo mass function, pro- 
file, and concentration to determine if the values measured from 
simulations are consistent with observations. Finally, the com- 
pleted SDSS measurements should be sufficient to determine if 
the method for distributing red and blue galaxies within a halo 
given in S02 accurately reproduces the observed clustering. 

In addition to the angular correlation function work presented 
here, there are a number of other measurements one might adopt 
to constrain halo parameters in a similar fashion to the method- 
ology presented here. In particular, measurements of both strong 
and weak lensing should provide a great deal of information 
about the structure of dark matter halos and the subsidiary in- 
formation in rich galaxy surveys should be sufficient to probe 
different galaxy biasing by type and color on physical scales, 
rather than the angular ones presented here. Likewise, one 
could readily construct alternative formulations of the power 
spectrum (weighting by star formation rate or bulge-to-disk ra- 
tios, for instance) which could be used in conjunction with 
spectroscopic and morphological information to inform future 
simulations. Deeper spectroscopic surveys could also attempt 
to find the break in the galaxies power spectrum shown in Fig- 
ure 4 as the fraction of galaxies shifts toward the blue at higher 
redshifts. Finally, one could also consider the clustering of 
galaxy clusters as a separate test of the linear behavior of the 
halo concentration and mass function. 
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Table 3 

Fisher eigenmodes for magnitude-based selection function. 



Parameter Basis Expected Error 



0.32y R + 0.84a + 0.43p 


±1.13 x 10~ 4 


0.55y R - 0.53a + 0.62p 


±1.85 x 10~ 4 


0.77y R - 0.63p 


1 

±6.79 x 10~ 4 


-0.371og(M B ) - 0.191og(M R0 ) + 0.53y B - 0.28A + 0.55M Bs + 0.27a+ 0.16p 


±8.99 x 10~ 4 


0.241og(M R ) + 0.21y B - 0.19A + 0.90M a 


±2.29 x 10~ 3 


0.461og(M R ) + 0.391og(M R0 ) - 0.30y B + 0.24M Bs + 0.48o a + 0.40a - 0.26a 


±2.88 x 10~ 3 


-0.751og(M R0 ) - 0.27y B + 0.24M Bs + 0.18M a + 0.36o a + 0.28/? 


±3.75 x 10~ 3 


-0.671og(M R ) - 0.18y B + 0.23M Bs + 0.33M a + 0.27o a - 0.27a - 0.43/? 


±4.37 x 10~ 3 


0.341og(M R ) + 0.311og(M R0 ) + 0.19y B + 0.38M Bs - 0.73a+ 0.21/? 


±4.92 x 10~ 3 


-0.321og(M R ) + 0.331og(M RO ) + 0.20y B + 0.29A + 0.37o a + 0.66/? 


±5.82 x 10~ 3 


0.411og(M B ) + 0.62y B + 0.29A - 0.33M Bs + 0.33o a - 0.32/? 


±8.21 x 10~ 3 


0.481og(M B ) + 0.23M Bs - 0.19o a + 0.79c 


±1.66 x 10~ 2 


-0.551og(M B ) - 0.36M Bs + 0.31o a - 0.31a - 0.20/? + 0.52c 


±2.04 x 10~ 2 


-0.311og(M B ) + 0.80A + 0.26M Bs - 0.35a a - 0.18/? 


±3.37 x 10~ 2 


0.97r) + 0.20c 


±0.11 


A 


±0.61 


A< 


±1.82 



Table 4 

Fisher eigenmodes for photometric redshift selection function. 



Parameter Basis Expected Error 



0.85y R - 0.52a 


±1.44 x 10~ 4 


0.52y R + 0.85a 


±2.76 x 10~ 4 


P 


±4.89 x 10~ 4 


-0.291og(M B ) - 0.171og(M R0 ) + 0.40y B - 0.18A + 0.35M Bs + 0.20o a + 0.71a 


±8.23 x 10~ 4 


-0.301og(M B ) - 0.181og(M R0 ) + 0.42y B - 0.24A + 0.41M Bs - 0.66a 


±1.15 x 10~ 3 


0.281og(M R0 ) - 0.14y B + 0.26M Bs - 0.21M a + 0.83o a - 0.17a - 0.15/? 


±2.28 x 10~ 3 


0.161og(M R ) + 0.25y B - 0.29M Bs + 0.80M a + 0.37o a 


±3.38 x 10~ 3 


-0.151og(M B ) + 0.681og(M R ) - 0.421og(M R0 ) - 0.30y B + 0.23M Bs + 0.42/? 


±3.91 x 10~ 3 


0.251og(M B ) + 0.181og(M R ) - 0.34y B - 0.39A + 0.46M Bs + 0.41M a - 0.17o a - 0.46/? 


±4.23 x 10~ 3 


0.561og(M B ) - 0.191og(M R ) + 0.191og(M R0 ) + 0.16y B - 0.22A + 0.27M Bs + 0.68/? 


±5.19 x 10~ 3 


0.321og(M R ) + 0.501og(M R0 ) + 0.25y B + 0.59A + 0.33M Bs + 0.18M a - 0.24o a 


±7.54 x 10~ 3 


-0.451og(M R ) - 0.491og(M R0 ) - 0.22y B + 0.53A + 0.31M Bs + 0.29M a 


±8.87 x 10~ 3 


0.601og(M B ) + 0.331og(M R ) - 0.361og(M R0 ) + 0.48y B + 0.18A - 0.32/? 


±1.42 x 10~ 2 


co 


±2.76 x 10~ 2 


il 


±0.16 


A 


±0.34 




±3.18 
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APPENDIX A 

HALO MODEL POWER SPECTRA 

Rewriting Equation 1 in terms of the concentration and the 
mass, we get 

Ps 



p(r,M) 



(rc/r v )- a (l+rc/r v ) 



3+a' 



where 



r 3 = 
'v 



PS = 



3M 
47tA v p ' 
A v pc 3 (M) 



c(M) 



d%- 



1 



2+a 



(Al) 



(A2) 



(A3) 



'(l+X) 3+a J 

p is the mean matter density and M is the mass of the halo. 
Since we will be working in wavenumber space when we gener- 
ate the power spectrum, we actually need to consider the Fourier 
transform of the halo profile, 



4%r 2 p(r,M) 



sin (At) 



dr, 



(A4) 



10 kr 
where we have normalized over mass so that y(0,M) = 1 and 
y(k > 0,AT) < 1. Note that this implies that p(r > r v ) = 0, trun- 
cating the mass integration at the virial radius. This condition 
can be relaxed, provided that one scales the halo mass appro- 
priately. 

With this in hand, we can move on to the next component 
of the halo model, the halo mass function, (dn/dM). The form 
of the mass function is given in Equation 7, but we need to 
properly normalize it by requiring that 

lJo^ MdM = J mdV = h (A5) 
for the dark matter distribution. On nonlinear scales, we expect 
the halos to cluster more strongly than the mass, and vice versa 
for linear scales (Mo & White, 1996). This means we need to 
positively bias the clustering of the high mass halos relative to 
the low mass halos. We can generate this sort of halo biasing 
scheme for the ST mass function using 



b(v) = 1 + - 



1 



2p 



5c - + wrvn- (A6) 

In order for the eventual power spectrum to reduce to a linear 
power spectrum on large scales, we need to impose the further 
constraint that 



r-oo 

/ f(v)b(v)dv 
Jo 



1, 



(A7) 



requiring that the biased halos with mass greater than M* be 
balanced out by anti-biased halos with mass less than . This 
integral is satisfied automatically if we use Equation A6 and 
have properly normalized /(v). 

Using just these three components, we can generate the power 
spectrum for the dark matter. However, in order to predict the 
galaxy power spectrum, we need to know how many galaxies 
are in a given halo (under the assumption that the distribution 
of galaxies in the halo follows the halo profile). These (7Y) rela- 
tions are given in Equations 8 and 9. The inclusion of galaxies 
does change the normalization of Equation A5 to 



I 



(N) 



■f(v)dv 



(A8) 



Jo M(v)- * ' p 
where n is the mean number of galaxies and p is the mean mat 
ter density at redshift z. 



On large scales, the power spectrum is dominated by correla- 
tions between galaxies in separate halos. We need to convolve 
the halo profile with the mass function to account for the fact 
that halos are not point-like objects. Since we are in Fourier 
space, we can perform the convolution using simple multiplica- 
tion. The halo-halo power (Pqq (&)) is then simply, 

1 2 



p r 

n Jo 



(AO 
Af(v) 



b(v)y(k,M)dv 



(A9) 



where P LIN (k) is the linear dark matter power spectrum, 

(8(k)8(k')) = (2n) 3 8(k-k')P L1N (k). (A10) 

For small scales, the dominant contribution to the power spec- 
trum comes from correlations between galaxies within the same 
halo. This single halo term is independent of k at larger scales, 
giving it a Poisson-like behavior. In order to account for the 
fact that a single galaxy within a halo does not correlate with it- 
self, we use the second moment of the galaxy number relation, 
(N(N - 1)), to calculate the Poisson power (PQ G (k)), 

= | rm mN M 1]) W,M)Uv. (All) 
n L Jo M 

Seljak (2000) takes £ = 2 for (N(N - 1)) > 1 and £ = 1 for 
(N(N — 1 )) < 1 ; this is done to account for the galaxy at the cen- 
ter of the halo in the limit of small number of galaxies. Adding 
PQ G (k) and Pqq (k), we recover the galaxy power spectrum at 
all wavenumbers, PggW- 



A. 1 . Calculating Subpopulation Power Spectra 

With the modifications to the mass distributions which go 
into red and blue galaxies given by Equations 13 and 14, we 
need to regenerate y(k,M) for each profile. We also need to re- 
normalize /(v) for each sub-population using Equation A8 to 
account for the differences in (N) and n: 

(N)r 



/„ 



f R (y)dv= — 

o M(v) JKy ' p 

o m Mv)dv =T 



(A12) 



Once this has been done, we can insert the above (along with 
the color-dependent halo profiles) into Equations A9 and All 
to generate the power spectra for red and blue galaxies: 



phh 



(*) 



Pm(k) 



r/ R (v; 

Jo 



M(v) 



b(y)y R (k,M)dv 



(A13) 



^w=p/;/ R( v)M^b R( .,M)i^ 



phh 



(k) 



PuN(k) 



PUk)=P 



iIo f ^B) Hv)MkM)dv 

/>b(v)^^ M ,,M)|W 
Jo «gM(v) 



As before, we generate the total power spectra (Prr(&) and 
^bb(^)) by taking the sum of these parts, 



^rr(£) 
Pm(k) 



P^(k)+PUk)- 



(A14) 
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APPENDIX B 

SUB-POPULATION SELECTION AND (N) PARAMETERS 

For all of the parameter values related to the red and blue 
galaxy HOD, we fit the relations in Equations 8, 9, and 1 1 to the 
galaxy catalogs produced by the GIF simulations. The details 
of the semi-analytic methods applied in the simulations can be 
found in Kaufmann et al. (1999), but we will briefly discuss 
some of the relevant features here. 

In the broadest strokes, semi-analytic methods like those ap- 
plied in the GIF simulations take the outputs of an N-body 
cosmological simulation at a number of time-steps, determine 
where galaxies will have formed based on some prescription 
and let the galaxies evolve from that point in time until the 
present. In the case of the GIF simulations, the N-body sim- 
ulations were generated using Hydra (Couchman, Thomas & 
Pearce, 1995), an adaptive particle-particle particle mesh code 
written as part of the VIRGO collaboration. Four different cos- 
mological models were used in the initial work, but, for the pur- 
poses of the calculations in this paper, we only used the ACDM 
outputs (Q. u = 0.3, Q. A = 0.7, h = 0.7, o 8 = 0.9) with the SDSS 
filters (not mentioned in Kaufmann et al.). These simulations 
are 141 hr l Mpc on a side and have a mass resolution on order 
IO^^Mq. 

At each time step used in the GIF simulations, a friends-of- 
friends group finding routine with linking length 0.2 was ap- 
plied to the N-body outputs and each group of 10 or more par- 
ticles was marked as a dark matter halo. The most bound mem- 
ber of each such group was flagged as the central galaxy of the 
halo. In subsequent time steps, previously determined halos 
are checked against mergers with other halos. In the case of a 
merger, the galaxy at the center of the most massive progenitor 
halo is considered to be at the center of the combined halo and 
the properties of that galaxy are transferred to the most bound 
particle of the new halo. Galaxies associated with less massive 



g -r 





1 i 1 i 

z = 0.06 i 

z = o.i3 : 

z = 0.27 ■ 

z = 0.35 _ 

■ - ■ - z = 0.42 ! 
z = 0.52 : 


■ , k.\ . i . i . i , i 


1 

1 ' 



0.25 0.5 0.75 1 1.25 



g "I 

FIG. 15. — Color distributions as a function of redshift using g' — r 1 (upper) 
and g' — { (lower). 



progenitor halos are now satellite galaxies of the new halo and 
remain associated with their original particles. In the GIF simu- 
lation outputs used for our calculations, each catalog contained 
- 90,000 halos and - 180,000 galaxies (of which - 35,000 
fell within our apparent magnitude cut). 

Once the positions of the individual galaxies within each halo 
have been set, the evolution of the stars in each galaxy can be 
determined. Even in the simplest terms, this requires a num- 
ber of considerations: availability of cool gas, star formation 
rate, supernovae feedback, initial stellar mass function, metal- 
licity (this is held at solar levels throughout the calculations), 
etc. In addition to these intra-galactic effects, there are also 
merger effects (combination of two satellite galaxies or in-fall 
of satellite galaxies into the central halo galaxy) and the as- 
sociated creation of galaxy bulges and star-burst activity. All 
of these processes require tuning to one degree or another in 
order to reasonably reproduce observed luminosity functions 
and Tully-Fisher relations. Since future SDSS angular cluster- 
ing measurements discussed here will combine both the galaxy 
evolution and clustering aspects of the model (at a variety of 
redshifts), they should serve as an excellent test for many as- 
pects of these treatments. 

In splitting the GIF simulation galaxy catalogs into red and 
blue samples, we had two primary considerations. First, we 
wanted to produce a selection method for the data that was ro- 
bust in segregating what appear to be two rather distinct sub- 
populations. Second, we wanted a criterion which could rea- 
sonably be applied to actual galaxies near the limit of our mag- 
nitude selection of r 1 = 21. The first of these requirements 
meant choosing a color cut that varied between the two pop- 
ulations slowly enough that passive evolution in galaxy colors 
over the extent of the redshift range was relatively small. The 
other requirement meant restricting ourselves to the g', r 1 and 
i' bands, as objects at the faint end of our magnitude cut will 
often fall below the detection threshold in u' and z' ■ 

The data sets we considered for this selection consisted of 
6 redshift epochs: z = 0.06, 0.13, 0.27, 0.35, 0.42 and 0.52. 
Figure 15 shows the distribution of rest-frame g' — r 1 and g' — i' 
colors for the galaxies at each of the redshift epochs. In both 
cases, the distribution is roughly bimodal, with a spike of very 
blue star-forming galaxies at g 1 ' — r 1 = g' — i' ~ 0. There is not 
an enormous difference between the two color distributions and 
it is clear that a simple straight line cut will select a slightly 
different population at higher redshift than at lower redshift. 
However, given the wider distribution of g' — i', we should suf- 
fer from less difference with redshift than with g' — r 1 . With this 
in mind, we split our sample at 

g'-z' = 0.85. (Bl) 

We can test the effectiveness of our color selection by look- 
ing at the variation of the (N) surfaces as a function of redshift. 
In Figure 16, we plot the surfaces for the whole galaxy sam- 
ple. As mentioned in §2, the overall galaxy surface appears rea- 
sonably static as a function of redshift and the sub-population 
surfaces behave similarly. We can also see that the shifting in 
apparent magnitude proceeds with redshift as we would expect. 

For a more specific look at the possible evolution of the (N) 
relations as it applies to the calculations in this paper, we can 
shift the surface from each redshift regime appropriately for the 
z ~ 0.3 selection function, apply our magnitude cut and com- 
pare the (N) curves. Figure 17 shows the (N) relations pro- 
duced by this method for the z = 0.06, 0.13, 0.27 and 0.35 
epochs. The (N)b curves show no significant signs of evo- 
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FIG. 16. — (N) as a function of halo virial mass and r' magnitude from the 
GIF simulations at z = 0.06, 0.27 and 0.52 (top to bottom). As in Figure 2, 
filled contours indicate log((A')) > 0, wire-frame contours indicate log((A')) < 
and successive contours a change of 0.25 in log((A')). 
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FIG. 17. — (N) relations for all galaxies (upper), red galaxies (middle) and 
blue galaxies (bottom). 



lution. There is some shift in the mass scales for (N)-r, both 
Mr and Mro, but the Yr is largely unchanged. In choosing our 
fiducial model, we fit the (N)r parameters from the z = 0.27 
epoch. While this behavior does make the modeling of the (N) 
relations more complicated than would be the case if the all the 
surfaces were static, the prediction of a red galaxy distribution 
evolving against a background of a static blue galaxy distribu- 
tion is intriguing. 

In addition to the (N) relations, we can also check the (N(N — 
1 )) relations for evolution. As one might expect from Figure 17, 
measurements of (N(N — 1))r from the simulations were very 
noisy and did not lend themselves to a reliable fit in the region 
where one expects strong deviation from a Poisson distribution. 
Rather, since we chose to use a universal 0Cm, we fit M a and 
Ga from Equation 11 using (N(N — 1)) and (N) measurement 
of the blue sub-sample. This approach may miss some of the 
evolution present in the red galaxy sample, but this seems un- 
avoidable. Unlike in the (N) measurements, more recent red- 
shift epochs did show stronger Poisson behavior at lower mass 
scales; M a stayed roughly fixed, but a a increased, leading to 
a more gradual decrease in 0Cm for lower mass halos. As with 
(N)r, the effect was not dramatic, but did result in roughly dou- 
bling a a from the z = 0.06 epoch from its value for the z = 0.35 
epoch. As before, we chose the parameter fits using the z = 0.27 
surfaces for our fiducial model. 



